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Abstract 

A polynomial depth quantum circuit effects, by definition a poly- 
local unitary transformation of tensor product state space. It is a rea- 
sonable belief [Fy][L][FKW] that, at a fine scale, these are precisely 
the transformations which will be available from physics to solve com- 
putational problems. The poly-locality of discrete Fourier transform 
on cyclic groups is at the heart of Shor's factoring algorithm. We de- 
scribe a class of poly-local transformations, which include the discrete 
orthogonal wavelet transforms, in the hope that these may be helpful 
in constructing new quantum algorithms. We also observe that even 
a rather mild violation of poly-locality leads to a model without one- 
way functions, giving further evidence that poly-locality is an essential 
concept. 

1 Introduction 

Feynman [Fy] described how, in principle, a finite dimensional quantum me- 
chanical system could be made to evolve so as to execute a sequence of 
"quantum gates." By a quantum gate, we mean one of the several rather 
standard unitary operators applied to a small (usually 1, 2, or 3) number 
of tensor factors (and the identity on the remaining factors) within a larger 
tensor product of factors of bounded dimension, often taken to be (C^)*^". 
For example: NOT, Controlled NOT, and Controlled i— phase is known to 
be a complete basis [Ki]. 

^Electronic mail: michaelf@microsoft.com 
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Lloyd [L] made the converse explicit. Let if be a poly-local Hamiltonian, 

L 

H — ^if^ is a sum of L < polynomial (problem size) terms H^, each the 

(.=1 

identity except for a constant number of tensor factors. The Hamiltonian H 

M 

exponentiates to U = e^'^**^ which is well approximated by U' = J|(gate)i, 

i=l 

where M < constant t • - ■ L"^ and e > is the allowed (operator norm) dis- 
crepancy between U and U'. The poly-local assumption on H is very natural 
for most real world quantum mechanical systems and is often summarized by 
the axiom: "physics is local." For example, in quantum field theory interac- 
tions tensors with more than four indices are generally not renormalizable. In 
more exotic contexts, such as, fractional quantum Hall systems [W] topolog- 
ical models have been proposed for which the Hamiltonian acting on internal 
degrees of freedom vanishes identically H = 0, yet after a topological event, 
such as a braiding b is completed, the internal state is transformed by U(6). 
For systems of this type (called: a unitary topological modular functor) a 
distinct argument was provided [FKW] that U(6) is poly-local. There seem 
to be few remaining candidates - perhaps quantum gravity - for a physical 
system having a finite dimensional state space whose evolution cannot be well 
approximated by a poly-local transformation U. This reinforces confidence 
in the quantum circuit model QCM [Ki], and motivates the examination of 
(families of) transformations having a poly-local description. Any poly-local 
unitary transform is available as a "subroutine" on a quantum computer, 
but it will not generally be a simple matter to recognize a transformation as 
lying in (or efficiently approximated by) that class. 

While a simple volume argument shows that only a tiny fraction of the 
unitary group U(2") can be approximated (up to phase) by poly-local trans- 
formations, it is not known how to give concrete examples which cannot be 
thus approximated. The situation is reminiscent of the fact that transcen- 
dental numbers are easily seen to predominate, but less easy to identify. To 
add to the puzzle, in most contexts, we will be free to add ancilli to create an 
inclusion v ^ v ® |0...0>,i7c H'^ of one Hilbert space into a larger one. 
It seems likely that there may be an operator T : H — > H which is not well 
approximated by poly-local operators but its extension T id : — > 
is. In this regard, Michael Nielsen has pointed out (private communication 
and Chpt. 6 [CN]) that the exponential of any Hamiltonian which is a sum of 
polynomially many product terms can, with the help of ancilli, be efficiently 
simulated on a quantum computer. There is no requirement on the Individ- 
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ual product terms except that they be products, i.e. a tensors of operators 
each acting on one (or up to log many) qubit(s). 

To appreciate the importance of a poly-local description consider the 
FFT. The reader familiar with the Fast Fourier Transform may be sur- 
prised that it has a virtue in the quantum context since it is already fast, 
0{N\ogN), classically. But on this scale, we should think of the quan- 
tum version as poly(logA^) in the sense that wc may Fourier transform 
in poly(logA^) time a data vector (/(| 1 >),..., /(| >)) (held in su- 
perposition and previously computed in poly(logA^) time) into its discrete 
Fourier transformation (/(| 1>),...,/(|A^>)). The measurement phase of 
quantum computation samples the transform according to the density of its 
L^— norm squared. In a sense, this is the best we could hope for in poly-log 
time. We could never output the entire transform in less than 0{N) time, 
instead sampling shows us its sharpest spikes. Similarly with other trans- 
forms, a poly-local description allows us to sample, according to the L^ — 
norm, that transform applied to any rapidly computable function. 

A key step in the Shor factoring algorithm is his description of the discrete 
Fourier transform F : Cm — ^ Cm for the cyclic group of order m, where m 
is a large smooth number, as a poly-local transform (The size parameter is 
logm so F must be written as a composition of poly (log m) many gates). 

We construct another class of examples by showing that cyclic periodic 
near diagonal matrices are poly-local (Thm 1). The construction is direct - 
no ancilla qubits are used. As an application, we prove (Corollary 1) that 
all the standard orthogonal discrete wavelet transforms, in particular the 
Daubechies transforms L>2„ [D], are poly-local. I thank C. Williams for point- 
ing out that he and A. Fijany previously obtained an explicit factoring of 
= Haar) and into quantum circuits [AW]. Our alternative approach has 
the usual virtues and demerits of abstraction. 

Some perspective into the limitations poly-locality imposes on the poly- 
time computational class of the QCM, BQP, can be achieved by adjoining 
oracles . Let On : (C^)®'* — ^ (C^)®'* be a "super-gate" which can be called 
upon in addition to gates to transform the computational state It is clear 
that if On encodes the answer to a difficult (or even undecidable) problem 
Pn, QCM^ will be improbable strong: If /„ : {0, 1}"-^ — > {0, 1} is a func- 
tion and A{fn) : {0,1}" — > {0,1}'' is defined by A/„(a;i, . . . , a;„_i, a;„) = 
{xi, . . . , Xn-i, fni^i, ■ ■ ■ 1 ^n-i) + ^n} ^-nd 0„ is the linear extension of A/„ 
operating on (C^)®", then we can compute /„(xi, . . . , Xn-i) by measuring the 
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last qubit of On{xi, . . . , Xn-i, 0). However, if we stipulate that On is "locally- 

poly" meaning that the entry of On be computable in poly(n) time, can 
the nonlocal structure of such On by virtue of their size alone, extend QCM? 
The following observation is evidence that the answer is "yes." 

Observation 1. For each family of bijections fn '■ {0, 1}" — > {0, 1}" which 
is computable in polynomial(n) time there is a family of oracles On with 
entries {On)i,j computable in poly{n) — time so that f^^ can be computed by 
BQP^". Simply set {On)ij = 5f(i)j- Write the hit string {yl, . . . ,yl) = 

y = y\l < J < 2". Then Ony = = y^"^^^ = f-\y), so 

applying On to {yi, . . . ,yn) and then measuring each qubit separately solves 

the inversion problem On{y{, ■ ■ ■ , yii) = {yl . . . , yl ^''^) . Thus, one-way 
functions disappear if we are permitted to adjoin locally-poly yet non-poly- 
local oracles. We have not succeeded in solving an NP complete problem 
with such an oracle. 

2 Periodic Band-Diagonal Transforms 

It is common in signal processing to encounter smoothing operators which 
when written as matrices are band diagonal. Often mathematical reasons 
suggest wrapping the interval into a circle and considering cyclically banded 
matrices M, Mjj = unless \ i — j \< h mod N,l < i,j < N. Two addi- 
tional conditions natural in signal processing are unitary, (M~^ = ikf^) and 
cyclicity (3k << N s.t. M^j = Mj+fe^ j+fe, addition of indices taken (modiV)). 
All these conditions occur in quadrature mirror filters [P] . We fix b and k and 
considering as parameter approaching infinity for studying the family of 
N X N matrices Mjy. To summarize the M^r are N x N unitary matrices, 
6— band diagonal in the cyclic sense, cyclic with period k, k\N, and 46 < A^. 
Furthermore even among different Mn, Mat/, we assume {MN)ij = {MNi)i/ji. 
If i = i' and j = j' mod k, so the matrices look locally identical. Purely for 
convenience, we also assume all A" are powers of 2, A" = 2", n > no some 
fixed constant and k = 2^, this makes the fit with qubits effortless. A fi- 
nal technical assumption is necessary to efficiently approximate even a small 
MtVo, A^o = 2"°, from gates available in the QCM. It is sufficient here [Ki] to 
require the entries Mij are algebraic numbers. This prevents some difficult 
to obtain information being encoded in the entries themselves. The theorem 
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Figure 1: White = zero, light shading = original K-periodic entries, dark 
shading = transition rows, and 1 = one. 

below asserts that the family {M^} is poly-local in the strictest sense: No 
ancilla are introduced to stabilize the problem. 

Theorem 2.1. Let {M^} be as above, there is a polynomial p so that for 
any e > there is an M'^, e close to Mn in the operator norm, so that M]y 



Proof 2.2. It is sufficient to consider = 2", for n greater than any fixed 
number, so the reader may image M^r, N comfortably large. For some 
K = 2^k, some j > 0, we will modify to Mn to a unitary K— cyclic 
matrix which agrees with except periodically for a small (order b) 
slab of rows where Mn agrees with the identity matrix and a small (order b) 
transition region between the two types of rows. 

The construction of is very similar to the truncation process given in 
[F,P] so we will not give formulas here, but simple principles from which 
may be constructed. Let V/ be the vector subspace spaned by the rows of 
Mn whose row index lies in I (We think of / as an interval with integer end 
points on the circle consisting of R/NZ). Let Wi be the subspace spanned 
by the basis vectors with indexes in /— equivalently those rows of the identity 
X A^— matrix. From the nonsingularity of unitary matrices we obtain: 
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Lemma 2.3. Whenever J contains the b— neighborhood of I, Wj C Vj and 
Vi C Wj. □ 

We apply the lemma several times below: consider an interval I of £ > 2b 
rows and a larger interval J of i rows + 2b rows containing I symmetrically. 
Let and J~ denote the intervals of length= 2b at containing the + and 
— end points of J respectively. The rows {q} of in the complement of 
/ together with the rows {s} of the identity matrix in the interval J span 
the entire space. Let {r} denote the rows of Mn with indices in and 
J~. The Gram-Schmidt process applied to {r} U {s} (The Gram-Schmidt 
process requires an ordering; work from the middle of the J interval outward) 
produces a Hermitian- orthogonal frame {t} for the space spanned by {r} U 
{s} (Note: {q} U {r} U {s} span the entire space). This process naturally 
leaves the central i — 2b rows of {s} unchanged, i.e. they remain standard 
basis vectors. The geometry of this process is pictured below. 




Figure 2 

The frame {t} consists of i standard basis vectors together with 2b vectors 
of length < 2b. Think of these 26— vector as mediating a transition from 
rows {q} of Mn to rows {s} of the identity matrix. The truncation process 
preserves unitarity and band diagonality and the band width. 

Mat is block diagonal consisting of 2""-'"^ identical blocks M (independent 
of N) down the diagonal. Thus, Mjv has a tensor decomposition Mm = 
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M id(c2) 

Define M — (Mjv)~^Mjv. Clearly the rows of Mjv agree with the identity 
matrix outside the J— intervals and Mjv is 6— band diagonal. Thus, 
can be written in a tensor product form as well: Af at consists of identical 
blocks down the diagonal of size K provided K > 2b. Thus, we may write 
Mjv = Po (M® id(c2)®n-j-f) oP~^ where P is a cyclic permutation matrix of 

{1, 2, . . . , N}. The permutation must arise since the blocks of M are off-set 
from the blocks Mat by a translation of y. 

Because we assumed the entries of Mn to be algebraic so are the entries 
of M and M. It is well-known that algebraic numbers can be efficiently 
computed using Newton's method so that refining the numerical accuracy of 
an algebraic a to up to a factor of (1 + e) requires only constant(Q;)- poly 
log(i) computational steps. Now by a theorem of [Ki] for approximating a 
unitary transformation in a fixed dimension as a product of gates, we find 

matrices M' and M , \M — m'\ < |, and \M — M \ < | using the operator 

norms, with m' and M a product of poly log(^) elementary gates. 

The cyclic permutation P and its inverse P~^ are, of course, poly(n) = 
poly(log A?") time computable classically. Writing \x >e {1,2,..., N} in an 
initial register and |0 > in an equal large (size = n, N = 2") register of ancilli 
a well-known sequence of operations effects the transformation: 

Pi(\x > 0\O >) compute? [\x > ®\0 + P{x) >) flip {\P{x) > ®\x >) 

computeP^ {\P{x) > ®\x + P~^P{x) >) 
= \\P{x) >(g)|0 > ) 

(where -|- denotes bit wise mod 2 sum in the register). More directly, it is 
possible to build size 0{n^) circuits from "NOT, "Controlled NOT," and 
"swap" gates (Sec Section 2.4 [AW].), which use no ancilli and implement 
addition of y modulo N . For example, a circuit implementing -\-2 mod (8) 
is shown below. 
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2's bit 



3's bit 
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Figure 3 



Thus, all the 4 terms of an e— approximation M'j^ may be written as a product 

of poly (n, log i) many gates: Mn ^ M'j^ = (M' (g) id) o P o (M' id) o p-^. 
□ 



3 Wavelets 

We define a discrete wavelet transform as the result of the pyramid algorithm 
[M] - applied to a band diagonal quadrature mirror filter. Daubachies' [D] 
original basis is recovered by choosing a quadrature mirror filter whose 
differencing rows have a maximal number of vanishing moments (consistent 
with orthogonality relations). For example, the simplest Daubachies wavelet 
Daub4 is built from an orthogonal matrix(and mirror filter): 
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■2" = N 



satisfying moment conditions: 

C3-C2 + Ci-Co = and OC3 - IC2 + 2Ci - 3Co = 0. 
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First let us explain in algorithmic terms how the wavelet coefficients of 

an data (column) vector X = {xi, . . . ,X]\f) arc computed starting from a 
filter Mn as above (compare [P]). The even components of M^X are re- 
garded as "smoothed" and these are set aside for further processing. The 
odd components of M^X are the finest scale wavelet coefficients. To con- 
tinue processing apply the matrix identical in description to Mjv, but of half 
the size to the even output of Mj^X : M7V/2 (even(MjvX)). Again set the 
even output aside: the odd output are the wavelet coefficients at the next- 
to-smallcst scale. Now apply Mn/4 I even (Mjv/2(evenMjvX))), etc... The 

process must terminate when M becomes so small that wrapping around of 
rows threatens unitarity. For Daub4 the last M will be a 4 x 4 matrix. The 
wavelet coefficients of X, usually written as X integrated against a function 
ijj, are the output of the final differencing (proceeded, possible, by several 
smoothings). The "mother" or "scaling" coefficients will be the output of 
successive smoothings (i.e. even rows). These are traditionally written as X 
integrated against a function 0; there will be two of these for Daub4. 

The pyramid algorithm consist of only \og{N) many applications of a 
filter matrix M, obeying the hypotheses of the theorem, and some easily 
computed permutations needed to sort out the even from odd rows between 
applications of M. Thus from the theorem, we have an evident corollary, 
which should be compared with the constructions of [AW] and [Kl]. 

Corollary 1. Consider a poly-time computable function of f from the inte- 
gers [1, . . . ,N] into a fixed basis of a vector space V of dimension = d. The 
function f is recorded by a vector VfeV^ . For any discrete (algebraic) wavelet 
transform T applied to f, there is a quantum circuit, without ancilla, of length 
poly(^\ogN, log ^) which yields a state space vector T' (v f)eV ^ which is within 
e > of the exact transform T[vf). The components ofT'iVj) can be sam- 
pled by von Neumann measurement according to the density \componentij\'^ , 
1 <i < N, 1 < j <d. 

The proceeding recipe, in so far as repeated smoothing is applied, sends a 
"Dirac function" 6 toward a fixed point of the iteration 4>{x) — > Sc„0(2a; — 
n). Such a fixed function is called the scahng function of the filter M. 
Wavelet coefficients (in the hmit) correspond to integrations against ■0(2*^x — 
n) where iIj{x) = H{—l)^Ci_k(f){2x — k). Whereas, the Fourier transform effi- 
ciently extracts information about periodicity (hence its relevance to divisibil- 
ity and factoring questions) the wavelet transforms (discrete or continuous) 
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extract qualitatively different information. It is less easy to say precisely 
what combinatorial structures wavelets are best suited to detecting, but the 
answer must relate to phenomenon with a certain amount of both spacial 
and frequency localization. Some studies suggest fractal behaviors, Haus- 
dorff dimension, and the like are readily observed in wavelet bases. Thinking 
democratically, one should not prefer either Fourier or wavelet bases, but seek 
different problems which can be solved by sampling the density of one trans- 
form or the other. So far on the Fourier side there is discrete log, factoring, 
and the abelian stabilizer problem [S] [Ki] - essentially similar applications of 
that transform, and no interesting quantum applications, yet known, for the 
wavelet transforms. 

4 Continuum Versus Vectorized Transforms 

Both Shors' [S] use of the Fourier transform and our treatment of wavelet 
transforms concerns the "vectorized" transform: The function values of x"' 
mod (n) in Shor's work and the values of / mod(d) in our corollary 1 are 
treated as independent basis vectors in an abstract vector space. The trans- 
forms do not treat similar function values similarly, but rather place each 
possible outcome of f{t) into a separate bin. Thus, there is no notion of the 
continuum limit of these transforms as presently constituted and they should 
not be confused with their continuous relatives in signal processing. The 
property of a function / on the integers having period — dis invariant under 
arbitrary permutation (or vectorization) of the target. This fact is crucial to 
the efficacy of the Fourier transform in Shor's algorithm. It is more difficult 
to see what portion of the information captured by a wavelet transform is 
invariant under target permutation or vectorization. 

Although it seems that a continuous - as opposed to vectorized - Fourier 
transform is not as useful for factoring, their may be other quantum appli- 
cations for the continuous version. In the case of wavelets, it is more likely 
that continuous rather than vectorized transforms will be important. In the 
following paragraph, we explain how to use phase coding of states to build ef- 
ficient continuous versions of any transform T (such as Fourier or wavelet) for 
which there is an efficient quantum circuit implementation of the vectorized 
version. 

Let F : [1, . . . , A^] — > [1, . . . ,d] be a funtion and denote t E [1, . . . , A^] 
as the variable. For 1 < x < ci, we may define fx{t) = if /(t) 7^ x, and 
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d 

fx{t) = 1 if f{t) = x; f{t) = S fx{t). A "vectorized" unitary transform 

x=l 

obeys: 

/ N ^ d N y x_ 

T( S |t > ®|/(i)>) = S S T(/^)t)|t>®|x> 

\t=l / a:=l J=i V / 

where 1 < t < iV is the transform variable, and states are written without 
numerical normalization factors. If we pick D » d and let w = e^'^*/-^, 
the set of complex numbers S = {w^\l < i < d} is a short discretized arc 
of a circle and is a rather good affine model for A = [1, . . . under the 

bijection 6 ^ g2m5/D_ After linear rescaling the quasi-isometric distortion of 
b is (A map g : X — > Y between metric spaces is said to have 

quasi- isometric distortion < K for some K >0 iS: 

d{g{xi),g{x2)) < d{xi, X2) < (1 + K)d{g{xi)g{x2)). 



1 + K 



We will neglect this distortion and code the values of a bounded real function 
h: [1, . . . , A^] — > [0, ^] into h: [1,...,N] — > A, h = o r o exp{ih) 
where r rounds to the nearest element of the form e^'^^ . Now a continuous 
version T of T applied to h is given by the formula: 



r( E |t>® >) = E Ew^(T(/^)t)|t > . 

\t=\ / x=\ t=l V / 

Note that there is a simple modification of a quantum circuit computing 
T to one computing T. Instead of simply recording the qubit |a; > by adding 
X to a D— state register R holding zero, the trick is to initialize R to the 
superposition: 

w°|l > +w^\2 > + ... w^-^\D > . 

Now adding x to thejibove state simply changes its phase by a factor of w^. 
Thus, the transform T, possessing a natural continuum limit, can be sampled 
in the QCM with no more difficulty than the discrete version T. If quantum 
computers are built, such continuous transforms would find application in 
signal processing. Beyond these obvious applications, it is interesting to 
wonder what algorithmic potential lies in these continuum transformations. 
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